Context Grabbing: Assigning Metadata in Large Document Collections

نویسندگان

  • Joachim Hinrichs
  • Volkmar Pipek
  • Volker Wulf
چکیده

Classification schemes are an important issue in the collective use of large document collections. We have investigated the classification of technical documentations in two engineering domains: a steel mill and a sewerage plant company. In both cases we found a coexistence of different classification schemes and problems resulting from distributed local archives. In supporting human actors to maintain different classifications schemes while working on a common archive, we developed the concept of context grabbing. It allows assigning context information efficiently in the form of metadata. Based on a document management system, a tool kit for context grabbing was developed. Its evaluation in a sewerage service company allows us to comment on important aspects of understanding the role of classifications in collaborative work.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Jyväskylä Studies in Computing 37 Contextual and Structural Metadata in Enterprise Document Management Contextual and Structural Metadata in Enterprise Document Management Jyväskylä Studies in Computing 37 Contextual and Structural Metadata in Enterprise Document Management University of Jyväskylä

Lyytikäinen, Virpi Contextual and structural metadata in enterprise document management Documents have a central role in organizations. While the amount of information continually increases, new kinds of methods for managing the documents are needed. Enterprise document management concerns the whole life cycle of documents in organizations, from emergence to disposition, and also development of...

متن کامل

Content-based document image retrieval in complex document collections

We address the problem of content-based image retrieval in the context of complex document images. Complex document are documents that typically start out on paper and are then electronically scanned. These documents have rich internal structure and might only be available in image form. Additionally, they may have been produced by a combination of printing technologies (or by handwriting); and...

متن کامل

Visual Topic Maps Layer between Document Collections and Learning Material

This paper introduces a semantic layer, called visual topic maps, to build a bridge between annotated document collections and the use of these documents as learning material. The main components of a visual classification are metadata-based topic maps attached to documents that allow customization according to users’ needs and profiles. The metadata of documents and visual topic maps are based...

متن کامل

Automated Template-Based Metadata Extraction Architecture

This paper describes our efforts to develop a toolset and process for automated metadata extraction from large, diverse, and evolving document collections. A number of federal agencies, universities, laboratories, and companies are placing their collections online and making them searchable via metadata fields such as author, title, and publishing organization. Manually creating metadata for a ...

متن کامل

The effect of assigning a metadata or indexing term on document ordering

The assignment of indexing terms and metadata to documents, data, and other information representations is considered useful, but the utility of including a single term is seldom discussed. We discuss a simple model of document ordering and then show how assigning index and metadata labels improves or decreases retrieval performance. The Indexing and Metadata Advantage (IMA) factor measures how...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005